How long does it take a society to learn a new term? 
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In this paper, I study the diffusion of new terms, called neologism, in social networks. I consider 
it as an example of information dynamics on networks and I hope that solving this problem can 
help us to understand and describe the information dynamics problem. To do so I develop a 
phenomenological model for the diffusion mechanism. I find an analytical relationship between 
number of people in the society who has learned the term and time taken. The Network parameters 
are imported in this analytical solution. I also present some simulation for this mechanism for 
several sample and some real networks which confirms the analytical results. In addition, I study 
the effects of network topology on diffusion process. 
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I. INTRODUCTION 

Recently, attempts to describe and understand the 
status and different behaviors of a society have been 
the subject of intense research. The structure of social 
networks has been studied and simulated by different 
types of random graphs such as scale free and small 
world graphsflj. Some methods were developed to 
identify communities in social networks @ , [3|] , [J] , [y] , 
and centrality in networks Q. There is no need to talk 
about its importance because it is easy to realize the 
effect of such an understanding on our life. 

Besides these attempts to study the societies, some 
new researches were developed to study phenomena 
that occur in the societies. One of the most impor- 
tant phenomena in the society is that when a piece of 
information enters the society, some people get it and 
then it starts to spread through out the network. The 
piece of information may be a gossip, a virus, fame 
of a film or a person and etc. This phenomenon is 
called information spread or information dynamics on 
networks. 

The information spread has been studied in sev- 
eral cases. For instance, the problem of gossip spread 
in social networks has been studied by Lind et al @. 
Similar researches were devoted to the dynamic of rep- 
utation [9|] and the diffusion of viruses [lfj , disease pdj 
and computer viruses [12|. 

In this work, I go through the spread of a new term 
in a society. Indeed, the main question I want answer 
is that, when a new scientific term or expression is 
produced, how does it spread through the scientific 
societies, but in this paper, I have studied and solved 
the general problem of the spread of any term in any 
social network. 

To do this, first I used graphs as a mathematical 
tool to describe the society in my model. Second, I 
present a phenomenological model to illustrate the dif- 
fusion mechanism where by phenomenological I mean 
that this model is mostly based on our intuitional un- 
derstanding of this phenomenon. In other words, in 
this model, as we may expect, the term is generated 
by someone out of the society and spreads through 
people's conversations. 



Here, I derive an analytical relationship between the 
number of people who have learnt the term and time 
taken. One of the key point in this job is that this 
relationship matches our expectations. In addition, I 
developed some simulations for this model which con- 
firm the analytical results. 

The structure of this paper is as follows: in sec- 
tion (JTTJ) , I introduce my problem and my model. In 
section (|III[) I present my analytical solution for term 
spread in society. In section (jWJ I explain some de- 
tails about the simulation that I developed for this 
problem. Then some of the key results are extracted 
in section (jV{. Finally, some concluding remarks are 
made in Section (I VII). 
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II. THE MODEL 

Here I describe my model and explain the main con- 
cepts, assumptions and my notation. 

I model the society as a graph G(N,E) where ver- 
tices N represent people and edges E represent their 
relationships. In other words a society is a set of ver- 
tices which are connected to each other if and only if 
corresponding persons are related to each other. This 
relation might be family relationship, friendship, a sci- 
entific cooperation, etc. Here, I use simple graphs for 
simplicity; directed and weighted graphs can be used 
in appropriate situations. For example if people's rela- 
tionships are not the same, I can use weighted graphs. 

On the other hand, I have to model the spread 
mechanism. The main idea in my model is that some- 
one learns the new term and then teaches it to those 
who he or she has a relation with. In this section I 
explain the implementation of this idea. 

To do this, I first need to implement the individual 
learning process because the society learning is based 
on it. As in a real situation, each person needs to 
hear the term several times to learn it. Indeed, ev- 
ery person hears the term time after time during the 
learning process, but he cannot use it unless the num- 
ber of times he has heard the term, di , passes a specific 
number of times which I call the learning threshold Di 
and depends on his abilities. The threshold, Di is one 
of the model's parameters. 

I also divide the society learning process into 
smaller processes. Indeed my model is based on re- 
peating a learning step which I call game Fig. (El). In 
each game, someone is chosen randomly and is called 



god tellj 
Neo? 



* 



Has Neo 

learnt? 



\ 



Tells the 
neighbors 




FIG. 1: stepl) god chooses Neo randomly step2) if Neo learns the term, he teaches it to his friends and the game is 
finished, otherwise the game is finished 



Neo. Neo hears the term and d^eo increases by one. 
If he learns the term, he would tell it to his friends. 
So his friends hear the term once. Otherwise, if Neo 
does not learn the term, he cannot use it in his con- 
versations. After Neo's action the game finishes. 

In each game, some people hear the term and their d 
increases by one and after several games their hearing 
times reaches the threshold D and they learn the term. 
In other words, in each game some people may learn 
the term also it may happen that no new person learns 
it. Repeating the game, the number of learnt people 
increases and approaches the total number of people 
in the society, N. 

In brief, the main characteristics of game are : 

god the one out side the game who knows the term 



and chooses the Neo and tells the term to him. 

Neo the one who is randomly chosen in each game by 
god. 

learner the one who has not learnt the term yet and 
d < D for him. 

learnt the one who has learnt the term and who can 
teach it to his friends. 

Also, there are some assumptions in my model 
which are as follows : 

1. The society network is static. In fact I assumed 
that people and their relationship are fixed when 
they learn term. Because the time scale of so- 
ciety dynamics seems to be negligible in com- 



parison with learning process time scale, my as- 
sumption is still valid. 

2. There is a god out side the society who knows 
the term and chooses Neo and as an external 
force, tells him the term. In a real society, 
different choices such as multimedia, language 
academies, etc can play it's role. 

3. The leaning process for all terms are the same. 

Having defined the model, I explain my approach 
to use this model to illustrate the learning process. 

I first need to quantify the learning process and 
present a measure for it. The number of learnt people 
seems to be a good quantity for this purpose. Al- 
ternatively, I need a measure to quantify the time in 
the game and a good choice for time is the number of 
games. In each society, each game takes some time 
to finish and the average of this time may be cal- 
culated easily. Consequently each game represents a 
time scale. Now the number of learnt people in each 
game, represent the learning improvement up to that 
time. Using these two, I can easily define other needed 
quantities such as learning speed. So, my model is 
complete. 

From now on, for better and easier understanding, 
I fix the following notations: 

• g Number of finished games 

• w Number of learnt people 

• N Number of people in the society 

• L Number of relationship between peopled E |) 

• D Average personal learning threshold 

• di Number of times that the i th person has 
heard the term. 

• ki Number of people who are related to i th 
person in the network or the degree of i th vertex 

• k Average number of friend each person may 
have 

I have studied this model with two approaches: 
first, analytical solution of learning improvement 
W(g). Second, simulation of model and studying the 
behavior of w versus g. Next section focuses on ana- 
lytical solution. 



III. ANALYTICAL SOLUTION 

In this section, I want to derive an analytical rela- 
tion between learning improvement w and spent time 
g. One of the key point in my job is that I make use 
of mean field approximation to do this. This means 
that the distribution of quantities over a randomness 
is represented by their average value. 

Here, I first calculate the learning improvement per 
game ^-, then I assume it to be ^p which is valid 
when the learning steps are increased in number and 



decreased in length. Consequently a differential equa- 
tion is obtained and solving the differential equation 
would result in an analytical solution of W(g). 

To calculate the learning improvement per game I 
first estimate the number of people who hear the term 
in a game, n game and then I can obtain the improve- 
ment in learning. To estimate the average number 
of people called in a game, n gamei I follow the game 
steps. In each game at least one person, Neo, is called. 
Then depending on NEO's state, different cases are 
possible. Neo may be in: 

1. Standby state:Neo has not learnt the term yet 
and does not learn in this game either. So he 
can not tell the term to his friends. 

2. Transition state: Neo has just heard the term D- 
I times so he would pass his learning threshold 
in this game and would be able to tell the term 
to his friends from now on. 

3. Active state: NEO has learnt the term in previ- 
ous games, so his knowledge does not improves, 
but he can tell the term to his friends. 

Number of learnt persons differs in each case. 
So I calculate the average number of people who 
learn the term in each game. To do so I 
need to calculate the probability of occurrence of 
each case p standby .PtransitiomPactive and the num- 
ber of people who learn the term in each case 
n s tandby,n tr ansition,n active . Then the average num- 
ber of people who learn term in a game, n game can be 
calculate as : 



Tlgame — P standby * ^standby 

i Ptransition * ^transition > P 'active * ^activ^J 

So I need just to calculate the pi and rn sets where 
i is a possible case. 

On the first step I calculate the pi set. I can make 
use of the total number of learnt people, W to guess 
the Neo's state. ^ indicates the probability of a per- 
son being learnt and therefore the probability of Neo 
being in active state. On the other hand, (I — ^) 
denotes the probability of Neo being in standby or 
transition state. As each person needs to hear the 
term D times to learn it, on first (D — I) times he 
would not pass the threshold and remains in standby 
state, but in D th time which is the transition state 
he joins the learnt persons. Thus, the chance of being 
in standby state is (D — l)'th times more than tran- 
sition state. As a result the probability of occurrence 
of these states arc obtained as: 
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The next step is calculating the rii set and for Neo's 
different states, they are easily obtained as : 

Standby: Only his own knowledge improves : 

^standby — -L 



Transition: As his knowledge improves he becomes 
able to call his friends:n tra „ S i t j „ = 1 + k 

Active: He can call his friends but his knowledge does 
not improves anymore: n ac ti ve = k 

Now according to Eq. ( Q]) I can calculate n game 



v^ W - 

l game : — / J 1H"i — A -\ — (K — A) (o) 



N 



where 



A 



d + k 
D 



(4) 



There is just one step left to calculate the learning 
improvement per game. The point is that among 
all those who have heard the term during the game, 
only the knowledge of those who have not learned the 
term, increases. In other words, if those who have 
heard the term are in standby or transition states their 
knowledge increases, so I have to multiply n game by 
Pstandby + Ptransition to obtain the average number 
of people whose knowledge improves. To determine 
the amount of improvement per each game, it suffices 
to determine the amount of improvement when each 
person is called. As explained person, needs to hear 
the term Dth times, so each time that he is called, 
his knowledge improves by -k. Finally the following 
differential equation is obtained: 



dW 
dg 



1 W - W 

- { A + -(k-A))(l--) 



again with: 



A:= 



D + k 
D 



Solving the differential equation leads to: 



W(g) = N- 






(^)e- 
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A 



(5) 



(G) 



(7) 



This equation ends the analytical solution. It is 
just left to check how good this relation describes the 
society learning. To do so, I developed and ran some 
simulation which is illustrated in the next section. 



IV. SIMULATION 

In this section I want to explain my simulation pro- 
cedure, some details about it and some of its features. 

First, I need to implement the society. To do so, I 
make use of a mathematical graph. Indeed, I produce 
a -M/v*./v matrix which is the adjacency matrix of the 
graph. Each row and column in this matrix represents 
a person in the society and person a is a friend of 
person b if M a .b = 1. 

To construct the adjacency matrices, I used the real 
network data of condensed matter collaborations col- 
lected by Mark Newman 13]. In addition, I made use 



of pajeck and ORA softwares to generate some ran- 
dom graphs such as scale free and small world and 
tested my model on generated graphs. 

Then, for each person i, I set a Di randomly be- 
tween 1 and 100 which means that people's learning 
threshold is between 1 and 100. 

Second, I use the adjacency matrix to implement 
the game. Each time a person is chosen randomly as 
Neo and his learning d^eo increases by one. Then if 
dNeo > DNeo, according to adjacency matrix Neo's 
friends are recognized and corresponding ds increases. 
Now the game is finished and a new game would start. 

I also defined an other parameter, w which increases 
each time that someone's learning passes his thresh- 
old. This parameter indicates the number of learnt 
people in the society. 

The game is repeated until a certain percent of so- 
ciety, a — W/N learn the term and the number of 
games is counted. Also, in this program, I calculate 
the number of past games for each a, 100 times and 
average over all the results. Consequently the number 
of games which is required to have a certain percent- 
age of the society learn the term, is obtained. 

I repeat this procedure for a = 0, .01, .02, ..., .98 to 
obtain the relation between a and number of games. 
Fig. ( HVI -b) presents my results for several networks, 
one real, one random, one scale free and one small 
world network. 

I used and analyzed the simulation results. Next 
section go through a comparison between simulation 
results and analytical solution. 



V. RESULTS 

In this section I first study the overall behavior of 
the mechanism according to both analytical solution 
and simulation results and then its dependence on 
model parameters such as number of edges and of ver- 
tices. I also check the results to be logical and match 
our expects. 



A. learning process 

The analytical solution describes the learning pro- 
cess as follows : at the beginning, the learning velocity 
is slow and only few people learn the term during first 
games. Then this velocity would speed up gradually 
till a specific percentage of the society learns the term 
and then it slows down again. Consequently, only 
few people learn during the last games. Fig. ( HV|) 
presents the learning process according to both ana- 
lytical solution and simulations for different network 
topologies. It is seen that simulation results confirm 
the analytical results for the learning process. 

In addition, Fig. ( IV A[) shows the learning velocity 
of the society versus time (g). It starts with zero, then 
speeds up to a maximum, and then slows down again 
for large values of g or large time and limits to zero. 

Next two sections illustrate the effect of network 
topology on learning process. 
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FIG. 2: a)learning behavior according to analytical solution for a network with N=500, and L=1500. b)learning behavior 
according to simulation result for different network topologies with N=500 and L=1500 
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FIG. 3: the speed of learning process versus number of 
past games for a network with N=500 and L=1500. 



FIG. 5: Learning process for networks with different num- 
ber of edges, L and fixed distribution function and N=500 




FIG. 4: Learning process for networks of different sizes, 
N and fixed distribution function and relationship size of 
L=1500 



As a result of both analytical solution and simu- 
lation results, societies with larger population need 
more time to be saturated which seems to be natural. 
Fig. ( IV B| presents the simulation results for learning 
process for societies of different sizes. It is seen that 
for larger N, the learning process graph shifts to larger 
time and saturates slower which means that for larger 
population, larger saturation time, g* is needed. 

On the other hand, according to Eq. ( [9|), the sat- 
uration time depends linearly on population size, N 
which means that analytical solution confirms the sim- 
ulation results. 



B. learning process and the size of 
population(N) 

Here I go through the effect of size of the popula- 
tion, N. To do so I work with the percent of society 
who have learned the term and games taken: 



W{g) = aN 



(8) 



I can calculate number of games taken for the desired 
percent according to Eq. ([7]). Solving Eq. (|SJ), I can 
find the game in which a percent of the society learns 
the term. I denote it as g* and it is obtained as: 



9 



D*NLog[ \^) a ) 



(9) 



From now on, I work with g* (a) as the saturation time 
for a desired amount of a — ^ . 



C. learning process and the number of 
relationship(L) 

The other result is about the effect of people rela- 
tionship, L on the learning process. Imagine a society 
in which most people know each other versus a society 
in which only few people are connected. In this section 
I describe how learning process differs for these two so- 
cieties. Again both simulation and analytical results 
are presented. Fig. ( IV C[) shows learning process for 
networks with different number of edges. These fig- 
ures, indicates that as people relationships increase 
the society learns the term sooner and the time scale 
of saturation decreases. 

On the other hand, the analytical solution predicts 
similar behaviors. Obviously, the k is proportional to 
number of edges L : 



k ~ N 



(10) 
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FIG. 6: analytical prediction for saturation time versus L 
for N=500 



Where s is a constant that depends on the distribu- 
tion function of degrees of vertices of the network and 
for a fixed distribution function it does not change. 
Replacing Eq. (jTUJ) in Eq. ( [5]) , the following relation 
is obtained: 



dN 2 Log 



9 



Ls(-l+a)+d(N(-l+a)-Lsa) -i 
(dN+Ls)(-l+a) J 

Ts 



(ii) 



This equation is plotted in Fig. ( IV C[> and describes 
the analytical behavior of saturation time g* versus L. 
According to this analytical result, when the size of 
relation between people, L increases, society learns the 
term faster and in fact the saturation time decreases 
which is compatible with both simulation result and 
our expectations. 



According to this job, the process in which a new 
term spreads out through a network is described as 
follows: At the beginning of the process, few people 
knows the term. Consequently, the role of people in- 
teraction in learning process is negligible in compas- 
sion with external force. But as time passes, the num- 
ber of learnt people increases and their contribution 
in learning process increases the process velocity. It 
means that the number of people who learn the term 
in a specific time increases. Finally when most of the 
society learn the term, as there are few learner people, 
the process velocity decreases and limits to zero. This 
description is illustrated in Fig. ( lIVp . 

Maybe the key feature of my job is that I have im- 
ported the topological parameter of the society net- 
work in my model. In other word, network topology 
is accounted in my job. For instance it were seen that 
when the number of people, N increases, the satura- 
tion time increases in turn or when the number of edge 
increases in network, the saturation time decreases. 

For the next step the relation between different 
terms in a language may be considered. For instance 
the interrealtion between terms may affect the society 
learning proccess and help people to learn it faster. 
Also the spread mechanism can be improved or other 
kind of diffusion mechanism maybe checked. 



VI. CONCLUSION 
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